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Picornaviruses (PV) and coronaviruses (CoV) are positive-stranded RNA viruses which infect mil- 
lions of people worldwide each year, resulting in a wide range of clinical outcomes. As reported 
in this study, using high throughput screening against ~6800 small molecules, we have identified 
several novel inhibitors of SARS-CoV 3CL?"° with IC59 of low uM. Interestingly, one of them equally 
inhibited both 3C’"° and 3CL?”° from PV and CoV, respectively. Using computer modeling, the struc- 
tural features of these compounds as individual and common protease inhibitors were elucidated to 


enhance our knowledge for developing anti-viral agents against PV and CoV. 
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1. Introduction 


Picornaviruses (PV) are small nonenveloped RNA viruses with a 
single strand of genomic RNA of 7500-8000 nucleotides [1]. The 
members of PV include rhinoviruses (RV), enteroviruses (EV), cox- 
Sackieviruses (CV), polioviruses, echoviruses, encephalomyocard- 
itis viruses, meningitis virus, foot and mouth viruses, hepatitis A 
virus, and so on. Among them, RV is the major cause of the com- 
mon cold, whereas EV and CV infection can cause hand, foot, and 
mouth diseases in human and animals. In severe cases, EV can 
damage the central nervous systems leading to viral meningitis, 
encephalitis, and severe myocarditis, as well as fatal pulmonary 
edema [2-5]. CV strain B is a major human pathogen that causes 
meningitis and mycocarditis leading to heart failure in young 
adults and congestive heart failure [6]. In these PV, a chymotryp- 
sin-like protease (named 3C?"°) is required to process polyproteins 
into mature proteins for viral replication, which represents a prom- 
ising anti-viral drug target [7]. 


* Corresponding authors. Address: Institute of Biological Chemistry, Academia 
Sinica, Taipei 11529, Taiwan, ROC. Fax: +886 2 2788 9759. 
E-mail addresses: ysjung@krict.re.kr (Y.-S. Jung), phliang@gate.sinica.edu.tw 
(P.-H. Liang). 


On the other hand, coronaviruses (CoV) are the positive- 
stranded RNA viruses with larger genome of 27-32 kb, which 
typically cause respiratory and enteric diseases, pneumonia, 
exacerbation of asthma, neurological symptoms, and myocarditis 
in humans and domestic animals. An outbreak of severe acute 
respiratory syndrome (SARS), caused by a novel human CoV, was 
spread from China to 29 countries in 2003, infecting a total of 
~8000 people and killing ~800 patients [8]. SARS-CoV contains a 
3C-like protease (3CL?"°) analogous to the 3C?"° of PV, responsible 
for processing two overlapping polyproteins, ppla (486 kDa) and 
pplab (790 kDa). Other members of human CoV including CoV- 
229E, CoV-OC43, CoV-HKU1, and CoV-NL63 also require a 3CLP"° 
in the maturation of viral proteins. 

Several inhibitors have been developed to inhibit the 3C?"° of RV 
and EV [9-12] and 3CLP"° of SARS-CoV [13-15]. However, their 
inhibitors can not be mutually used without modification. For 
example, AG7088, a potent inhibitor of RV and other picornaviral 
3CP° [16], failed to inhibit SARS-CoV 3CLP"° [17]. Unlike the 3CLP", 
which is dimeric and in which each subunit is composed of three 
domains, the 3C?° is a monomer with only the two catalytic 
domains. The structure-based sequence alignment (Fig. 1) shows 
some sequence differences, which may alter inhibitor specificity. 
In this study, we performed high throughput screening using a li- 
brary of ~6800 compounds to find five novel inhibitors of the 
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SARS-CoV 3CL?'°, 4 of which also inhibited another human CoV- 
229E 3CL?*°, but did not inhibit the 3C*° from RV14, CVB3, and 
EV71. But, one compound was found to almost equally inhibit 
these 3CLP"° and 3C"°. From computer modeling, we rationalized 
the binding discrepancy of the inhibitors against these proteases. 
The information is useful to further develop more potent individual 
or common inhibitors of 3CP*° and 3CLP"° of PV and CoV for anti- 
viral drug discovery. 


2. Methods 
2.1. Expression and purification of the proteases 


Two types of proteases including 3CL’° from SARS-CoV and 
CoV-229E and 3C?"° from CVB3, EV71, and RV14 were used to assay 
the inhibitors in this study. The SARS-CoV 3CLP"® and EV71 3CP*° 
were prepared as reported previously [12,18]. For expressing 
CVB3, RV14, and CoV-229E proteases, the genes were cloned from 
viral cDNAs by using polymerase chain reaction (PCR) as reported 
elsewhere. 


2.2. Primary screening 


For screening, 0.05 uM SARS 3CL”"°, 6 uM fluorogenic substrate 
Dabcyl-KTSAVLQSGFRKME-Edans, and 50uM of approximately 
6800 compounds provided by Korea Chemical Bank (Daejeon, Kor- 
ea) were used. Enhanced fluorescence of the reactions in the buffer 
of 20 mM Bis-Tris at pH 7.0 was monitored at 538 nm with excita- 
tion at 355 nm using a fluorescence plate reader. The compounds 
which inhibited more than 50% of the protease activity at 50 uM 
were selected for the next assay run at 10 uM. 


2.3. ICsg determination 


The five hits that inhibited SARS-CoV 3CLP"° at 10 uM were also 
evaluated against CoV-229E 3CLP"°?, EV71 3CP*°, CVB3 3C"°, and 
RV14 3C?"°. In the assay solution, the activities of these proteases 
(0.5 uM) with 10uM fluorogenic substrate in the buffers of 
10mM MES at pH 6.5 and 6.0 (the optimal pH for EV71 and 
RV14 proteases, respectively) and 10 mM HEPES at pH 7.5 (for 
CoV-229E and CVB3 proteases) were measured in the presence 


Domain I 
SARS 3CL 1 SGFRKMAGPSGKVEGCMVQPJTCGTTTLN WLDBP)- TV EDM 49 
229E 3CL 1 AGLRKMAQPSGFVEKCVVRMCYGNTVLN WLGIa)- 1 V TTS 49 
CVB3 3C 1 — AFEGRNV AMMKROsS S TAAKEGE YSIE | YRIw AMLiEA-SsIAKE- - - - - 43 
EV71 3C 1 eis SLOGBAILSLLRRN!I RQMQHED QIGH V REEL AWJL[BsasIS Qld- - - - - 43 
RV14 3C 1 e- NTEGRNLSLLRKI MT! ThgS KE | HEV CMI TEIAQII- - - - - 43 

aN A1 B1 C1 
SARS3CL 50 LNPNYEDLLIRKSNHSFLVQAGNVQLRPMMIGHSMQN------- cC[§LR VB) 92 
2299E 3CL 50 A-IDYDHEYSIMRLHNFS[JJI SGTAFLGMJVGINTMHG------- VTLKIEQVS 91 
CVB3SC 48 «ce seceaes wees ce P THELMNDQE VGMJLERK EMR) K OG TRS N 78 
EV713C 0 4B we wee ow we oe ore K TEEWVEHKLVK I VERNV EDRIASE OG Vitel VT 78 
RVI4@3C 48 «cece ne nc oe co oe DDVLVNGQK1 RIQKE)K Y KERQP EN | RS VOT 78 
D1 E1 = F1 
SARS3CL 93 T 132 
229E 3CL 92 Q 131 
CVB3 3C 79 =R 127 
EV71 3C 79 «OT 127 
RV14 3C 79 OR 126 
SARS3CL 132 <<<<<«<«-. NHTI 174 
9299F SCL 131 =< 62 «ss NWT I 173 
CVB3 3C 128 GG THRIKEML us 171 
EV71 3C 128 SGKiZBSHEIT MMi 171 
RV14 3C 127 SS TERENEIM! RAY 170 
C2 
SARS3CL 175 TD[JEGKFYGPFV AeNtevccanaut Bynes RMA, URS e hice x 224 
229E 3CL 174 SSFDGVMYGGFEBJQPNLEEIVESANQMLESVIBIVEVF gw! L CTWIWIEKGEK 223 
CVB3 3C 172 FRATEL KHRBAN DED « 2.0 ee ee eee cee cee ete ewe ew ee ew eee eee eees 183 
EV71 3C Wee STN LCE Sc) ) § (4 O Bie ee ee 183 
RV14 3C WAM ACL Ugly © ESO Bee ee er ee ee 182 
Domain Ill 

SARS3CL 225 LE VEHYREWAOANGE TAMNGEDA FS MARA eey CMERU COAT Oy fine 274 
229E 3CL 224 LFVEHYIMNEWRIQANGFTAMNGEDA- -FS I [BARKER V CIJERLLHEY! QV -[EN 270 
CVB3 3C 199 2 6 oo 8 6 a 8 O'S 6S Oe OH OO OS BO 8 HO 8 eo 6 wie Oe Oe Oe HOC ee ee ee 183 
EV71 3C <i a a a ee 183 
RV14 3C 192 -ceceeercccse ewe ee ceo ee mses eee ee meee seem nec eceees 182 


SARS3CL 275 BP ones: s stubs wen ic VM 306 
229E 3CL 271 F Gle]K QUMGY S STEND Sas | NEVAYKCIMF[EAYN Le] 302 
CVB3 3C it: ne 183 
EV71 3C—_: 183 


RV14 3C 182 


Fig. 1. Astructure-based sequence alignment of SARS-CoV 3CL?"°, CoV-229E 3CL”"°, CVB3 3CP"°, EV71 3CP"®, and RV14 3C?"°. The domains according to 3CL”’° are shown above 
the sequence and the secondary elements according to the 3C?"° structure are shown below it. Arrows indicate the essential catalytic amino acids His and Cys for 3CL?"° and 
3CP"°, and Glu (only for 3CP"°). 
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of various concentrations of the inhibitors to obtain the ICso 
values. 


2.4. Computer modeling of the inhibitors binding with the proteases 


For the modeling analysis, we used the crystal structure of SARS 
3CLP*° in complex with a peptide inhibitor (PDB code 1UK4) [19], 
the structures of CoV-229E 3CLP*° and CVB3 3C?"° solved by us, 
and the structural model of EV71 3C?*° constructed from the struc- 
ture of RV 3CP"° (PDB code 1CQQ) [20]. Docking process was per- 
formed using an automated ligand-docking subprogram of the 
Discovery Studio Modeling 1.2 SBD (Accelrys Inc., San Diego, CA), 
with a set of parameters chosen to control the precise operation 
of the genetic algorithm. Docking runs were carried out using stan- 
dard default settings “grid resolution” of 5A, “site opening” of 
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12 A, and “binding site” selected for defining the active site 
cavity. 


3. Results 
3.1. Screening of the protease inhibitors 

We first screened against a library of ~6800 compounds for 
inhibiting SARS-CoV 3CL?"®. From the primary screening, there 
were 66 compounds which showed more than 50% inhibition of 
the enzyme activity at 50 uM. We further tested their inhibitory 
activities at 10 uM and five of them (21155, 22723, 27548, 
43146, and 48511) showed ICs9 values smaller than 10 uM. 


According to their dose-response curves as shown in Fig. 2A-E, 
the five hits 21155, 22723, 27548, 43146, and 48511 displayed 
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Fig. 2. Dose-response curves for the five hits against SARS-CoV 3CL?"° from the screening. ICs) values were determined from the curves using equation 1. These were (A) 
7.2 0.7 UM (21155), (B) 10.6 + 1.3 uM (22723), (C) 7.04 0.8 uM (27548), (D) 3.3 +0.2 uM (48511), and (E) 8.1 + 0.9 uM (43146). The structures and activities of these 


inhibitors are summarized in Table 1. 


552 C.-J. Kuo et al./FEBS Letters 583 (2009) 549-555 


Table 1 
Summary of ICs 9 values (uM) of the five hits with SARS-CoV 3CL?"°, and other 3C(L) proteases. 
Compound ID Structure SARS 3CL 
A boa 
N 
2 Cir 8 fe 3107 
1155 Sy E // \ a2 0); 
O 
O 
Cl OH 
sj) N 
SON 
N 
22723 Cl N O 10.6 + 1.3 
OH 
O O 
i 
27548 N 7.0 + 0.8 
O 
/ 
—N 
48511 a5 202 
iL -2 
o )— 
O 
43146 N 8.1+0.9 


ICso values of 7.2+0.7, 10.6+41.3, 7.0+0.8, 3.3+0.2, and 
8.1+0.9 uM, respectively, against the SARS 3CLP*°. Similar 
inhibition results were observed for 3CL”"° of CoV-229E (data sum- 
marized in Table 1), but not for 3C*°. However, 43146 inhibited 
both 3CP° and 3CLP*® with ICs59 values of 10.3+1.1 uM, 5.4+ 
0.2 uM, 3.3+0.3, and 5.2+0.6 uM, respectively, against CoV- 
229E 3CLP°, CVB3 3CP, EV71 3CP° and RV14 3C?° (Fig. 3A-D 
and summarized in Table 1). This compound contains a dihydropy- 
razole ring with three substituents, two phenyl groups and a 
lengthy N-butyl-benzimidazolylamino-toluene. 


3.2. Inhibition potencies of the 43146 analogues 


Since 43146 inhibited 3CLP*° and 3C"°, its analogues including 
45240, 68638, 55688, and 55585 obtained from another compound 
library were evaluated. As shown in Table 2, all of them showed 
good potencies against the five proteases. The most potent 
compound was 45240, and its ICs59 values in inhibiting the 3C(L) 
proteases were measured to be 2.5+0.2 uM (SARS-CoV 3CL?*°), 
2.6+0.4 uM (CoV-229E 3CL™°), 1.2+0.3 uM (CVB3 3CP"°), 0.5 + 
0.1 uM (EV71 3CP*°), and 1.7 + 0.1 uM (RV14 3C"°) (Table 2). This 
compound contains four rings, three phenyl groups and one imid- 
azole, surrounding a central dihydropyrazole ring, without the 
lengthy side chain as seen in 43146. Compound 68638 with ben- 
zylcyclohexane ring fused with the dihydropyrazole ring and acet- 
yl and iodobenzyl groups attached to the central ring showed less 
inhibition against both 3C?"° and 3CL”° (Table 2). The other two 
compounds, 55688 and 55585, with shorter side chains attached 


229E 3CL CVB3 3C EV71 3C RV14 3C 
5.6 + 1.0 >50 >50 >50 
12.4+0.8 >50 >50 >50 
6.6 + 0.3 >50 >50 >50 
1.8 + 0.7 >50 >50 >50 
10.3 + 1.1 5.4+0.2 53 203 5.2 + 0.6 


to the benzimidazolyl group showed similar inhibitory activities 
as compared to 43146 (Table 2). 


3.3. Computer modeling of 21155, 22723, 27548, and 48511 binding to 
the proteases 


These inhibitors are competitive inhibitors with respect to the 
substrate (data not shown), indicating they bind in the active site. 
To rationalize the binding discrepancy of these inhibitors against 
these proteases, their binding modes with SARS-CoV 3CLP*° and 
four other proteases were modeled and some of them are shown 
in Fig. 4. The first four inhibitors of SARS-CoV 3CLP"° are more rigid 
because the thiazolopyridine in 21155, the dichlorobenzoquinoli- 
none in 22723, the isoindoledione in 27548, and the oxazole in 
48511 adopt planar structures and the three substituents of the 
oxazole ring in 48511 are fixed in a conformer, due to the 1,2-steric 
interaction between the acetate group and the N-aryl imino group 
as Well as the biaryl interaction between the phenyl and oxazole to 
prohibit their free rotation. All these compounds can be considered 
as two rigid aromatic moieties connected by a small linker. Based 
on the computer modeling, each of these aromatic moieties is 
bound to S1 or S2 site of SARS protease by forming H-bonds and 
hydrophobic interactions (Fig. 4A-D). As shown in the computer 
modeling, Glu166 side chain of SARS 3CLP'° forms H-bonds with 
these four inhibitors. However, the corresponding amino acid res- 
idue in 3C?"° is Gly164, which lacks the side chain to form H-bond 
with any of these compounds (also see Fig. 4F), leading to loss of 
inhibition. In addition, the 3C?'° have more open but shallow S2 
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Fig. 3. Dose-response curves for 43146 against 229E 3CL?"®, CVB3 3CP"°, EV71 3CP"° and RV14 3C?"°. ICs values were determined from the curves using equation 1. These 
were (A) 10.3 + 1.1 uM (229E 3CL”"), (B) 5.4 + 0.2 1M (CVB3 3C?"°), (C) 3.3 + 0.3 UM (EV71 3C?"), and (D) 5.2 + 0.6 LM (RV14 3CP"*), 


Table 2 
ICs59 values (UM) of compound 43146 analogs with SARS-CoV 3CLP", and other 3C(L) proteases. 


Compound ID Structure SARS 3CL 229E 3CL CVB3 3C EV71 3C RV14 3C 
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Fig. 4. Computer modeling of the binding modes of the inhibitors in the active site of the SARS 3CL?"™. The more rigid moieties of the inhibitors 21155, 22723, 27548, 48511 
are probably bound to S1 and S2 sites as shown in (A), (B), (C), and (D), respectively. (E) 43146 binds SARS-CoV 3CLP"° differently with the biphenyl] 4,5-dihydro-1H-pyrazole 
moiety anchored at the S1’ and S2 sites and the rest of the molecule at the S3 and the following sites. The hydrogen bond interactions were represented by yellow dotted lines. 


site (due to its partial blockage by Leu127) than 3CLP"° according to 3.4. Binding modes of 43146 and its analogues to the proteases 

the crystal structures of RV 3C?"° [20] and CVB3 3C"° (Lee et al., 

unpublished results) compared to 3CLP"° (also see Fig. 4F). Thus, In contrast, the compound 43146 is more flexible, because the 
3CP° can not hold these compounds tightly. dihydropyrazole is not planar, and the phenyl group is linked to 
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the sp*-hybridized carbon of the dihydropyrazole ring, so it is free 
for rotation. Different from the binding modes of the other 4 inhib- 
itors, the diphenyl 4,5-dihydro-1H-pyrazole moiety of 43146 fits 
well at the S1’ and S2 sites in the SARS 3CL?"° (Fig. 4E) with the rest 
of the molecule at the S3 site and beyond. With this binding mode, 
the compound was predicted to also bind well in the 3C"°, consis- 
tent with the inhibition data. In fact, RV 3C?"° prefers a phenyl 
group at the S2 site as evidenced by its strong inhibition by 
AG7088 which has a P2-fluorophenylalanine. Thus, it could be 
rationalized by computer modeling that only 43146 among the five 
hits can inhibit the three 3C"° in addition to the 3CLP"®. 

The analogues of 43146, including 45240, 68638, 55688, and 
55585, bind in the 3C”"° and 3CLP"® active sites with similar modes 
to that of 43146 (data not shown). Compared to 43146, 55688 and 
55585 only have minor structural difference with shorter alkyl 
groups attached to the benzimidazole ring, so that they showed 
similar inhibition against the proteases. The fused ring system 
and the phenyl group in 68638 may also span from S1’ to S2 sites 
in both kinds of proteases, yielding similar inhibition. However, 
45240 showed a significantly better inhibition against the 3C?P"° 
than 43146. Apparently, the lengthy side chain attached to the 
phenyl group in the compound did not provide additional interac- 
tion with the protease, consistent with the binding mode shown in 
Fig. 4E. However, the additional interaction is provided by the pyr- 
idine ring bound near the more open S1’ site in 3C?"". 


4. Discussion 


AG7088 is the best inhibitor identified so far for 3CP"°, which 
not only inhibits the 3C’'° from RV, but also those from CV and 
EV [16]. However, it did not inhibit 3CL?’° from SARS-CoV [17]. This 
may be partially due to the blockage of its P1-lactam ring by the 
relatively larger Glu166 side chain and also the S2 site of 3CLP"° 
is narrower although it is deeper. Therefore, when the P2-phenyl- 
alanine is changed to non-planar leucine or cyclohexane without 
changing the Pl-lactam, they became good inhibitors of SARS 
3CLP*° [21]. Unlike AG7088, which is a ketomethyl isostere of a 
tripeptide-conjugated ester, compound 43146 is not peptide- 
like. From the random screening as shown in the study, we have 
found a starting point toward the development of non-peptide 
multiple-function inhibitors against CoV and PV. With further 
modification of these individual and common inhibitors of the viral 
proteases, we hope to find solution for the possible reoccurrence of 
SARS and other diseases caused by the viruses with the 3C?"° and 
3CLP”, 
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